A Composite Kernel Approach for Detecting Interactive Segments in Chinese Topic Documents
نویسندگان
چکیده
Discovering the interactions between persons mentioned in a set of topic documents can help readers construct the background of a topic and facilitate comprehension. In this paper, we propose a rich interactive tree structure to represent syntactic, content, and semantic information in text. We also present a composite kernel classification method that integrates the tree structure with a bigram kernel to identify text segments that mention person interactions in topic documents. Empirical evaluations demonstrate that the proposed tree structure and bigram kernel are effective and the composite kernel approach outperforms well-known relation extraction and PPI methods.
منابع مشابه
Classifying Attitude by Topic Aspect for English and Chinese Document Collections
Title: Classifying Attitude by Topic Aspect for English and Chinese Document Collections Yejun Wu, Doctor of Philosophy, 2008 Dissertation directed by: Professor Douglas W. Oard College of Information Studies & Institute for Advanced Computer Studies, UMCP The goal of this dissertation is to explore the design of tools to help users make sense of subjective information in English and Chinese by...
متن کاملComposite Kernel Optimization in Semi-Supervised Metric
Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...
متن کاملFISER: An Effective Method for Detecting Interactions between Topic Persons
Discovering the interactions between the persons mentioned in a set of topic documents can help readers construct the background of the topic and facilitate document comprehension. To discover person interactions, we need a detection method that can identify text segments containing information about the interactions. Information extraction algorithms then analyze the segments to extract intera...
متن کاملA Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia
Dialog topic tracking aims at analyzing and maintaining topic transitions in ongoing dialogs. This paper proposes a composite kernel approach for dialog topic tracking to utilize various types of domain knowledge obtained fromWikipedia. Two kernels are defined based on history sequences and context trees constructed based on the extracted features. The experimental results show that our composi...
متن کاملApplication of Radon Transform in Detecting Turning Angle of Bodies and in Reading Multi - Lingual Documents
Recently, image processing technique and robotic vision are widely applied in fault detection of industrial products as well as document reading. In order to compare the captured images from the target, it is necessary to prepare a perfect image, then matching should be applied. A preprocessing must therefore, be done to correct the samples’ and or camera’s movement which can occur during the...
متن کامل